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Abstract 

In this paper we will study uniformly distributed sequences of par- 
titions of [0, 1], a concept which has been introduced by Kakutani in 
1976. We will construct new families of u.d. sequences of partitions 
and study relations with the classical concept of uniformly distributed 
sequences of points. 

AMS classification: 11K06, 11K45, 28D05, 60B10, 60F15, 60F20 



1 Introduction 

If we split the interval [0, 1] in two equal parts, and then in four equal parts 
and keep going, splitting it in 2 n equal parts at the n-th step, we get the 
sequence of dyadic (or binary) partitions which are finer and finer. The 
procedure is, in an obvious sense, uniform. 

If on the other hand we take any a g]0, 1[ different from one half and split 
[0,1] in [0, at] and [at, 1], and follow a similar procedure as before, splitting 
each interval of the previous partition in two parts, left and right, propor- 
tional to a and (5 = 1 — a respectively, then we get the so-called a-dyadic 
partitions, whose behavior depends of course on a. An obvious observation is 
that the points of the partitions are now not evenly distributed, since [0, a] 
and [a, 1] contain the same number of points of any subsequent a-dyadic 
partition, in spite of the fact that they do not have the same measure. 

In [15] Kakutani introduced the following interesting variant of the split- 
ting procedure just described. 

Definition 1.1 If a e]0, 1[ and n = {[£,_!, £j] : 1 < % < k} is any partition of 
[0, 1], its a-refinement (which will be denoted by an) is obtained by subdivid- 
ing only the interval(s) ofir having maximal length in two parts, proportional 
to a and /3 — 1 — a, respectively. 

Kakutani's sequence of partitions {n n } is obtained by successive a-refme- 
ments of the trivial partition u> = {[0, 1]}. 
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For example, if a < (3, K\ — {[0, a], [a, 1]}, k 2 = {[0, a], [a, a + a/3], [a + 
a/3, 1]}, and so on. 

Note that for a = \ we get just the sequence of dyadic partitions. 

To express things with the appropriate precision, we need the following 
definition. 

Definition 1.2 Given a sequence of partitions {u: n } of [0, 1], with 

*n = {[*?_!,*?] :l<i<fc(n)}, 

we say that it is uniformly distributed (u.d.), if for any continuous function 
/ on [0, 1] we have 

A simple but useful observation is that uniform distribution of a sequence 
of partitions {n n } is equivalent to the weak convergence to A (the Lebesgue 
measure on [0, 1]) of the associated sequence of measures {u n }, with 

I fc(n) 

where 5 t denotes the Dirac measure concentrated in t. 

This reformulation allows to use classical compactness criteria for weak 
convergence (see [19]). 

We will use in the sequel another standard argument: a sequence of 
partitions is u.d. if and only if (1) holds for a family of Riemann integrable 
functions Q such that its linear span is dense in the class of the continuous 
functions on [0, 1] with respect to the || • ||oo norm ([16], Chapter 1). 

We can state now Kakutani's main result. 

Theorem 1.3 For any a e]0, 1[ the sequence of partitions {k h } is uniformly 
distributed. 

This fact got a considerable attention in the late seventies and early eight- 
ies, when other authors provided different proofs of Kakutani's theorem [1] 
and also proved several stochastic versions, in which the intervals of maximal 
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length are split according to certain probability distributions ([17], [18], [25], 
[22], [23] and [4]). The expository work [2] gives a complete overview on the 
results obtained before 1984. 

The paper [8] extends the notion of uniform distribution of a sequence of 
partitions to probability measures on complete separable metric spaces. 

More recently some new questions and ideas revived the interest for the 
subject and few items have been added to the pertinent bibliography ([20], 
[6], [7]). The second of these papers in particular extends Kakutani's split- 
ting procedure to higher dimension with a construction which is intrinsically 
higher-dimensional. Further results are on the way, like for instance the ex- 
tension of the theory to certain fractals and the explicit computation of the 
discrepancy of a particular class of sequences of partitions. 

This paper will contribute further results to this subject. 

In Section 2 we generalize Kakutani's procedure by splitting the longest 
interval into a finite number of parts homothetically to a given partition p 
and prove that {p n uj} is uniformly distributed. 

In Section 3 we associate to a given u.d. sequence of partitions random 
reorderings of the points determining the partitions, proving that almost 
surely they provide uniformly distributed sequences of points. This answers 
a question posed by David Fremlin. 

Section 3 provides an explicit link between our area of investigation and 
the theory of uniformly distributed sequences of points, initiated by Hermann 
Weyl in [26]. We believe our results will have applications in quasi- Monte 
Carlo methods, because new simple u.d. sequences of low discrepancy (to be 
discussed in another paper) can be generated by our methods, and combined 
with the approach initiated by van der Corput (in [24] and seven subsequent 
papers numbered from I to VIII), Halton [10], [11] and Hammersley [12]. 

An excellent and by now classic exposition of results on uniformly dis- 
tributed sequences of points is [16]. For more recent results and applications 
see for instance [9], in particular Chapter 3. 

Acknowledgements The autor wants to thank David Fremlin and Pietro Rigo 
for helpfull discussions. 



3 



2 p-refinements 



In this section we will geneneralize Kakutani's splitting procedure and pro- 
duce a new class of u.d. sequences of partitions. 

Let us give the following definition. 

Definition 2.1 Consider any non trivial finite partition p of [0, 1]. We will 

keep it fixed for the whole section. The p-refinement of a partition n of [0, 1] 
(which will be denoted by pn) is obtained by subdividing all the intervals of 
7r having maximal length positively (or directly) homothetically to p. 

Obviously, if p = {[0,a], [a, 1]}, then the p-refinement is just Kakutani's 
a-refmement. 

As in Kakutani's case, we can iterate the splitting procedure. The p- 
refmement of pn will be denoted by p 2 7r, and the meaning of p n n, for n G IN, 
does not need any explanation. 

In this section we will prove that the sequence {p n uj} of successive p- 
refinements of the trivial partition uj is u.d.. 

Remark 2.2 It is worth noticing that it is necessary to put some restric- 
tion on the partition n (even in the simplest case of the Kakutani splitting 
procedure) if we hope for uniform distribution of {p n 7r}. 

It would be interesting to find significant sufficient conditions on ir in order 
to obtain the uniform distribution of {p n 7r} even for the case of Kakutani's 
splitting procedure. 

The following simple example shows where lies the problem. 

Let 7r = {[0, |], [|, 1]} and consider p = {[0, |], [\, 1]}. It is clear that the 
p-refinement operates alternatively on [|,1] and [0, |], and that the subse- 
quences {p 2n 7r} and {p 2n+1 7r} converge to measures which attribute to [0, |] 
the values | and |, respectively, so {p™7r} does not converge. 

Let us now fix some notations and recall some preliminarily facts which 
will be used to prove Theorem 2.7. 

Let p = Ti\ : 1 < i < k} be the fixed partition of [0, 1]. Denote by 

cti = Ti — ri-i, for 1 < i < k, the lengths of the k intervals of p. 

Let us denote by [p] n the so-called n-th p-adic partition of [0, 1], obtained 
from [p] n_1 (where [p] 1 = p) by subdividing all its k n ~ x intervals positively 
homothetically to p. If an interval belongs to [p] n , we will say that it has 
rank n. 
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The k intervals [YX=i a h,Ysh=i a h], 1 < i < k, of p will be denoted by 
I{cti). If [%-i,%] = I{ a ii a i2 ■ ■ -«in-i)> 1 < J < is a generic interval of 

rank n — 1, the A;™ intervals of [p] n are 

/ = I{a h a i2 . ..a in ^a in ) 

in 1 in 

= + - ^ <*h, S/j-i + - %-i) H • 

7i=l h=l 

Here the ccj's are understood as symbols (following [1]), not as numbers. 
So if for instance cti = a 2 = «3 = |, /(aia^) = [|, |], while I(a 2 ai) = [|, |]. 
Note that A(/(a il a ia . . . a^a^)) = Um=i a w 

Let X = {«!, a 2 , • • • , and let cr be the probability on X such that 
a {{ a i}) = ®i for 1 < i < k. 

Put X m = X and cr m = cr for any m G W. 

Denote by Y the countable product Ilm=i X m . On Y we consider the 
usual product probability p. If C = C{a il a i2 . . .a in ) = Ilm=i ^in where 
= {«j m } for m < n and = X for m > n, is a cylinder set, then 
KC) = \T m=l a lm . 

To every point i e [0,1] we can associate a sequence {a im } such that 

* G n^ =1 /(a il o; i2 . . . oi^oO . 

There is an (expected) ambiguity to be taken care of: there are two such 
sequences {«j m } for the countable set of points belonging to the endpoints 
of some interval of [p] n ; in that case we associate to t the sequence for which 
definitely c*i h = a±. 

This defines a 1 — 1 mapping between [0, 1] and a subset Y' of Y, obtained 
by removing from Y the countable set of sequences {«i m } such that definitely 
a im = a k . Note that p{Y \ Y') = 0. 

This mapping is measure preserving if we take on [0, 1] the Lebesgue 
measure and on Y' the restriction of p. This follows immediately observing 
that cylinder sets C and p-adic intervals / bearing the same indices have the 
same measure. 

If / and J are two disjoint subintervals of [0, 1] having the same length, 
and J = I + c, say, the function fi y j which takes values x + c on J, x — c on 
J and x otherwise, is measure preserving. 
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Let us denote by T the family of all such functions fi t j, where now I and 
J are disjoint p-adic intervals of the same length. The intervals / and J need 
not have the same rank. The next proposition proves an important property 
of T. 

Lemma 2.3 The family T is ergodic. 

Proof. We will exploit the measure isomorphism described above. Denote 
by T' the corresponding family of transformations on Y'. When / = f IyJ 
and / and J do have the same rank, the corresponding function /' on Y' is 
a permutation of a finite number of coordinates and it preserves the product 
measure p,. The family of these transformations is ergodic by a theorem due 
to Hewitt and Savage ([14]). Since it is contained in J 7 ', the latter is also 
ergodic. By the isomorphism between (Y', p) and ([0, 1], A), we conclude that 
T is ergodic, too. 

For the partition p n u, let A n denote the length of the longest interval and 
a n the length of the shortest interval. We have the following estimate. 

Lemma 2.4 For any n, 

a x A n < a n . 

Proof. Since A n < 1, the inequality is true for n = 1. Proceed now by 
induction. Suppose a\A n ^i < a n -\. There are two possibilities: either a n = 
o n _i or a n < a n _i. In the first case, since A n < A n _i for any n, 

a x A n < a x A n _ x < a n _i = a n . 

In the second case the shortest interval of p n u results from the splitting of 
the longest one of p n ^ l uj, in other words a n = a x A n ^i, so the conclusion 
follows in either case. 

Lemma 2.5 If n n = p n oo, then 

lim diam n n = . 

Proof. With the notation introduced above, obvioulsy a n < ^ for any n. 
Apply now the previous lemma to get 

diam ir n = A n < — < . 

a x na\ 
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Remark 2.6 An immediate consequence of the previous lemma is that if 
we consider the family of the characteristic functions (which are Riemann 
integrable) of all the intervals belonging to the partitions p n u, n G IV, its 
linear span is dense in the class of all continuous functions on [0, 1] with 
respect to the || • norm. 

We are now ready for proving the main result of this section. 

Theorem 2.7 The sequence {p n uj} is uniformly distributed. 

Proof. It is well known that the set of all Borel probability measures on 
[0,1], with the topology associated to the weak convergence, is metrizable 
and compact ([19], Theorem 6.4). Let us denote by {u n } the sequence of 
measures associated to the partitions {p n uj} as defined in Section 1. Then 
{v n } admits weakly convergent subsequences. All we have to prove is that 
any such subsequence converges to A. 

The plan of the proof is the following: let v be the limit of a convergent 
subsequence of {v n }. Then Lemmas 2.4 and 2.5 provide lower and upper 
bounds for the Radon-Nikodym derivative ^. The ergodicity of the family 
T tells us finally that this derivative has to be constant almost everywhere 
and therefore v — A. 

Every interval J belonging to some p n u belongs to some [p] m u;, but it is 
also true, viceversa, that every interval I G [p] m u sooner or later appears in 
one of the partitions p n uo. This is due to the fact that 



is well defined, since p-adic intervals are either disjoint or one contains the 
other. Moreover the diameter of p n u tends to zero. This implies that I = J s . 

Let now J be any p-adic interval and suppose that m G IV is such that 
every p n u, for n > m, subdivides J. 

If J is subdivided by p n u into k intervals, then obviously 



where a n and A n are the quantities considered in Lemma 2.4. 

If k(n) is the number of intervals of p n u, the previous double inequality 
can be rewritten, in terms of the measure v n (by Lemma 2.4) as 



s = sup{r : J r G p r u, I C J,} 



ka n < A (J) < kA 



A(J) 
k(n)A 



-n 



< Vn{J) < 




! n 
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for any n > m. It follows that, for all n sufficiently large, 



aiA(J) < 



aiA(J) 
k(n)a n 



k{n)a\A. 



A(J) 



-n 



< 



A(J) 



Suppose now that a subsequence {^ nfc } converges weakly to v. From the 
last inequality we can conclude that for any p-adic interval J it is 



Hence, by Remark 2.6, the same inequality holds for any Borel set B. 

Therefore A « v « A and if we denote by ^ the Radon-Nikodym 
derivative of v with respect to A, we have A-a.e. 



Observe that if / and J are two intervals which have the same length and 
belong to some p n uj (they need not have the same rank), then our splitting 
procedure behaves on them in the same way. This implies that u(I) = u(J). 

Therefore v is invariant with respect to the family of functions JF consid- 
ered in Lemma 2.3, where we proved that it is ergodic. It follows that ^ is 
constant almost everywhere, i.e. v = A. In other words, the sequence {p n w} 
is u.d.. 

3 Associated uniformly distributed sequences 
of points 

In this section we will study the relation between u.d. sequences of partitions 
and u.d. sequences of points. 

Let us recall the following definition. 

Definition 3.1 A sequence of points {x n } in [0, 1] is said to be uniformly 
distributed (u.d.), if 



aiA(J) < u(J) < 



A(J) 





for every continuous function on [0, 1]. 
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Suppose {vr„} is a u.d. sequence of partitions, with n n = {[£"_!,£"] : 1 < 
% < k(n)}. Then it is natural to ask whether the points {t™} determining the 
partition 7i n , for 1 < i < k(n) and n G IV, can be rearranged in order to get 
a u.d. sequence of points. 

There are of course many ways of reordering the points {£"}. A nat- 
ural restriction to make is that we first reorder all the points defining 7Ti, 
then those defining 7:2, and so on. We will limit ourselves to such kind of 
reorderings and call them sequential. But still there is a lot of freedom left 
in rearranging the points for n fixed, and not all of them produce the 
desired result. 

Consider again the dyadic sequence of partitions. Put 1% = [^r, for 
n G IV and h — 1, 2, . . . , 2 n . Let us order, for any fixed n, the left end-points 
of the I^'s by magnitude. We obtain in this way the so-called lexicographic 
order which does not give a u.d. sequence, as it is easy to check. 

It is known from the literature that the (by far) best thing one can do 
is the following (see [16], Theorem 3.5). Any positive integer k can be rep- 
resented uniquely by its dyadic expansion k = J2i=i a i^\ with a, G {0,1}, 
a s = I. 

Set now, for any k > 1, Xk = X^=i&i2 _ * _1 . This is called the van der 
Corput's sequence. It is a well known result that {xk} is u.d. and that it has 
optimal discrepancy (for more information, see also [9], Chapter 1 and also 
the brief discussion in the last section of this paper). 

So the lexicographic order is no good, van der Corput's method is excel- 
lent. What can be said about the generic sequential reordering? We will give 
here a probabilistic answer which applies to any u.d. sequence of partitions. 

First we need a version of the strong law of large numbers for negatively 
correlated random variables. This result is attributed in [4] to Aleksander 
Rajchman and can be proved, with small modifications, along the lines of 
[4], Theorem 5.1.2. 

Lemma 3.2 Let {f n } be a sequence of real, negatively correlated random 
variables with variances uniformly bounded by V on the probability space 
(W,P). Suppose moreover that 

lim E((pi) = M . 

i—>oo 
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Then 



1 n 

lim — ifi = M almost surely . 



We will also need few simple properties of a particular family of discrete 
random variables. Let ip be the random variable taking with probability ^ 
values in the sample space W = {wi G [0, 1], 1 < % < k}, k > 2. We may, 
and do, assume that Wi-i < Wi for 1 < i < k. Denote by ipi the value of 
(p in the i-th draw from W, without replacement. Fix now c g]0, 1[ and let 
ipi = X[o,c[(^i)> where X[o,c[(") denotes the characteristic function of [0,c[. We 
have then the following simple property. 

Proposition 3.3 The variances of the random variables ipi, 1 < i < k, are 
bounded by \ . Moreover, the ipi 's are negatively correlated. 

Let us now prepare the setting for the main result of this section. 
If {7r n } is a sequence of uniformly distributed partitions of [0, 1], where 
n n = {[tf-ijtf] : 1 < i < k(n)}, the sequential random reordering of the 
points t™ is a sequence (denoted by {f m }) made up by consecutive blocks of 
&(1), k(2), . . . , k(n), . . . random variables. The random variables of the n-th 
block have the same law and represent the drawing, without replacement, 
from the sample space W n = {t", t% , . . . , ££( n )}, where each singleton has 
probability tv-t. 



Denote by T n the set of all permutations on W n , endowed with the natural 
probability compatible with the uniform probability on W n : each permuta- 
tion r n G T n has probability ^yy. 

Any sequential random reordering of the sequence {7r n } corresponds to a 
random selection of r n G T n , for each n G IV. The permutation r n identifies 
the reordered /c(n)-tuple of random variables ifii for Kin — 1) < i < K(n), 
where K(n) = YPJi=\k{i). The set of all sequential reorderings can be given 
therefore the natural product probability on T = Y\^ =1 T n . 

Theorem 3.4 If {7r n } is a uniformly distributed sequence of partitions of 
[0, 1], then the sequential random reordering of the points tf defining them is 
almost surely a uniformly distributed sequence of points in [0, 1]. 
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Proof. Observe first that if < c < 1 and <p m belongs to the n-th block of 
k(n) random variables, then 

■y k(n) 

E(X[o,4<Prn)) = E X[o,c[(*?) , 

and this quantity tends to c, when m and hence n tend to infinity, since {7r n } 
is u.d.. 

Observe that Proposition 3.3 applies to any set of k(n) random variables 
= X[o,c[(Vm)) f° r K{n — 1) < m < K{n). It follows that the sequence 

{ipm} is negatively correlated, since the correlation is negative if the random 

variables belong to the same block and is zero if they belong to different 

blocks, because they are independent. 

Let {ch, h G JN} be a dense subset of [0, 1]. Fix h G JN and consider the 

sequence {x^c^^m)}- We may apply to it Proposition 3.2 and get 

1 n 

J™-ExMfe)=ft a. s. 

for any c h . 

But this is a well known sufficient condition for uniform distribution (com- 
pare Exercises 1.1 and 1.3 in [15]). 

4 Final remarks 

The two main results of this paper, Theorem 2.6 and Theorem 3.4, rise several 
questions. 

The first result calls for explicit computation of the discrepancy of the 
sequence {p n oj}. This has been done for a class of these sequences and will 
appear elsewhere. 

For a finite set of points W = {U G [0, 1] : 1 < i < k}, the discrepancy is 
defined by 

Dw = sup 

0<a<6<l 

A natural problem is to estimate the behavior of the sequence {Dw n } 
(where W n is the set defining the partitions p n oo) when n tends to infinity. 
It is easy to see that a sequence of partitions is u.d. if and only if {Dw„} 



Ej=l X[a,b[{ti 



- (6 - a) 
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tends to zero, when n tends to infinity, but it is of particular interest to find 
partitions p that such that the speed of convergence is as high as possible. 
Some classes have been found and will be a subject of a separate paper. 

It is easy to see that if W n = {0, ^, . . . , then {Dw n } = \ and that 

this speed cannot be improved ([16], Chapter 2, Theorem 1.2 and Example 

1-1)- 

On the other hand a classical result says that for u.d. sequences of points, 
the discrepancy behaves, at best, as l2£™_ This has been a long-standing 
open problem solved by W. M. Schmidt [21]. Sequences of points whose 
discrepancy is of this order are said to have low discrepancy. 

There are also several questions connected to Theorem 3.4. 

It would be interesting to compare the discrepancy of {ir n } and that of 
"most" of its sequential random reorderings. In particular, if the discrepancy 
of the sequence of partitions behaves like ^, what is the chance to get a low 
discrepancy sequence of points? 

In the same stream of ideas, given the discrepancy of the sequence of 
partitions {n n }, it would be intresting to estimate the average speed of the 
discrepancy of sequential random reorderings, 

Another interesting problem is to find explicit algorithms to provide 
low discrepancy sequences of points, given a low discrepancy uniformly dis- 
tributed sequence of partitions. Some partial results in this direction have 
been already obtained. 

This list of open problems is non exaustive, but it is long enough to 
confirm Victor Klee's "conjecture" that the ratio between solved problems 
and open questions tends to zero, when time goes to infinity. 
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